OpenAI offers over $550k for a safety lead to oversee AI model releases, a key role in addressing risks from advanced AI, as per the CEO.....
OpenAI offers a high salary of approximately 3.89 million RMB for a safety lead to assess AI model risks, amid challenges including lawsuits over ChatGPT's impact on mental health.....
New York signs the Responsible AI and Education Act, setting safety standards for advanced AI models. Starting 2027, AI firms with over $500M annual revenue must disclose key information, seen as a response to federal deregulation, marking a step in state-level AI oversight.....
Waymo is testing the integration of Google's Gemini AI into its self-driving taxis to create an in-car AI assistant, designed as a safe, restrained, and highly contextual passenger service engine.....
Multi-agent AI system that can create end-to-end, brand-safe marketing campaigns in minutes.
Quickly and accurately convert images to text, support multiple languages, ensure safety and privacy, and offer a free trial.
A free online tool that can instantly detect and remove AI watermarks and zero-width characters, which is safe and fast.
A powerful AI text humanization tool that transforms AI-generated content into natural, human-like content. Free trial available.
Tencent
-
Input tokens/M
Output tokens/M
Context Length
Alibaba
$1.8
$5.4
16
Anthropic
$105
$525
200
Deepseek
$2
8
32
$6
$18
256
Baichuan
$8
$3.5
$7
4
Moonshot
$10
$30
131
01-ai
dogeater1612
This is a Dutch patient simulation model fine-tuned based on Google's Gemma 2 9B model, specifically designed for medical education scenarios. The model always responds in a fixed JSON structure, simulating a patient 'Maria' who has just completed surgery and is in the recovery process, supporting nursing students to exercise communication and clinical reasoning skills in a safe environment.
DevQuasar
This is a 32B parameter reward model developed by NVIDIA based on the Qwen3 architecture. It is specifically used for reward scoring and principle alignment in reinforcement learning, helping to train AI systems that are safer and more in line with human values.
prithivMLmods
Qwen3-4B-SafeRL is a safety-aligned version of the Qwen3-4B model. It enhances the model's robustness against harmful or adversarial prompts through reinforcement learning training. This version uses a hybrid reward function for optimization, balancing three objectives: safety, usefulness, and minimizing unnecessary rejections.
LL1999
This is a LoRA model trained based on the AI Toolkit by Ostris, specifically designed for text-to-video conversion tasks. The model uses the Safetensors format and supports use on multiple platforms such as ComfyUI and AUTOMATIC1111.
Qwen
Qwen3-4B-SafeRL is a safety-aligned version based on the Qwen3-4B model. It is trained through reinforcement learning and combined with the reward signals of Qwen3Guard-Gen, enhancing the model's robustness against harmful or adversarial prompts. While ensuring safety, it avoids overly simple or evasive rejection behaviors.
yihong1120
A target detection model based on YOLO11, specifically designed for construction site safety monitoring. It can detect various safety hazards such as workers not wearing safety equipment, intrusion into dangerous areas, and equipment approaching hazard sources.
HugoHE
M-Hood is a series of models specifically designed to mitigate the hallucination phenomenon in object detection. Through novel fine-tuning strategies and a revised benchmark dataset, it significantly reduces false alarms on out-of-distribution data and enhances the safety and reliability of object detection systems.
phronetic-ai
Owlet Safety 1 is a multi-label safety event detection model fine-tuned based on Qwen2.5-VL-3B-Instruct. It is specifically designed for safety activity recognition in video surveillance and can simultaneously detect multiple safety-related events such as fires, smoke, falls, and assaults.
oscarstories
LORA is a fine-tuned version of the Mistral Small 24B Instruct model, optimized for child-friendly language generation in educational scenarios. It focuses on providing safe, age-appropriate, and engaging storytelling services for children aged 6 to 12.
DMindAI
DMind-1 is a Web3 expert model built upon Qwen3-32B, optimized for the Web3 ecosystem through supervised instruction fine-tuning and human feedback reinforcement learning, achieving significant improvements in task accuracy, content safety, and expert-level interaction alignment.
AIML-TUDA
QwenGuard is a visual security protection model capable of evaluating images based on provided security policies, outputting safety ratings, safety categories, and evaluation rationale.
QwenGuard-v1.2-3B is a visual security protection model developed based on Qwen/Qwen2.5-VL-3B-Instruct, designed for assessing the safety of image content.
KaraKaraWitch
A pre-trained language model that integrates multiple 70B-parameter-scale models, based on the 'Forgotten Safe Word' foundation model, featuring an unrestrained style and a tendency for novel/story writing.
Ateeqq
This model is specifically fine-tuned for NSFW image classification, categorizing content into three safety-critical classes, suitable for content moderation, safety filtering, and compliant content processing systems.
Eviation
Flex.2-preview is a text-to-image model offering multiple quantized versions, supporting Safetensors and GGUF formats.
meta-llama
Llama Guard 4 is a native multimodal safety classifier with 12 billion parameters, jointly trained on text and multiple images for content safety evaluation of large language model inputs and outputs.
sofiascat
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control lunar landings.
mradermacher
Beaver-7B-v3.0 is a 7B-parameter large language model based on the LLaMA architecture, focusing on safety and human feedback reinforcement learning (RLHF).
TheBlueScrubs
A medical text safety and ethics classifier based on the ModernBERT architecture, specifically designed for evaluating text safety in the medical field
google
ShieldGemma 2 is a model trained on Gemma 3's 4-billion-parameter IT checkpoint for cross-critical-category image safety classification, receiving images and outputting policy-compliant safety labels.
mcp-golang is an unofficial Go implementation of the Model Context Protocol library, supporting the rapid construction of MCP servers and clients, and providing features such as type safety, low code volume, modularity, and two-way communication.
Laravel Vibes is a powerful Laravel package for implementing the Machine Control Protocol (MCP) server, supporting seamless integration of AI agents. It provides features such as a tool registration system, real-time communication (SSE), and API endpoints, and supports automatic discovery and type safety.
Conduit is an MCP server that provides API integration for Phabricator and Phorge, supports HTTP/2 and type safety, and can be run through a Docker or HTTP/SSE server.
CodeSeeker is a code search and transformation tool that combines the functions of ugrep and ast-grep, providing intelligent search and replacement capabilities for AI assistants. It supports multiple search modes, safe replacement, and code refactoring.
MCP as a Judge is a behavioral MCP server that acts as a validation layer between AI coding assistants and LLMs. By enforcing evidence - based research, code quality reviews, and human decision - making intervention, it ensures the generation of safer and higher - quality code.
A TypeScript template for creating MCP servers, offering best practices such as type safety, dependency injection, and a service architecture, supporting tool development and testing.
A PostgreSQL database interaction tool server based on the Model Context Protocol (MCP), supporting high-performance asynchronous operations and transaction safety.
This is an open - source project that includes over 35 pre - built MCP connectors for integrating SaaS tools into AI applications, supporting local development and type - safe configuration.
The task portal system is a self - evolving general problem - solving institution with core components such as logical reasoning, ethical framework, sequential thinking, and meta - framework, which can safely self - evolve and solve complex problems.
Agentek is an extensible TypeScript toolkit designed to simplify complex interactions with EVM blockchains, providing a unified type - safe interface, supporting multi - chain operations and AI integration.
The Trello MCP Server is a server based on the Model Context Protocol, providing tools for interacting with the Trello API, including a complete Trello function interface, modular architecture, and type - safety support.
An MCP server that provides tools for interacting with Trello boards, supporting card and list management and activity tracking, with built-in rate limiting and type safety.
A powerful PostgreSQL MCP server that provides full read and write access permissions, supports transaction management and safety control, and is more powerful than the official read - only version.
A server that provides make functionality through the MCP protocol, allowing LLMs to safely execute targets in Makefiles
An SSH server implementation based on the MCP protocol, providing secure remote access and execution functions, including SQLite database integration and TypeScript type safety support.
An MCP server that converts Postman API collections into type - safe AI tool code
The Exa MCP server is a middleware that connects AI assistants and the Exa Search API, enabling AIs like Claude to safely conduct real-time web searches and obtain structured results.
An MCP server implementation integrating the SearXNG API, providing web search functions and supporting features such as pagination, time filtering, language selection, and safe search.
Malloy MCP server implementation for executing Malloy queries and managing resources, providing functions such as metadata access, error handling, and type safety.
The Brevo MCP Server is a comprehensive marketing automation platform integration service that provides complete Brevo API functionality using the official Node.js SDK, including 8 major tools such as email, SMS, contact management, and marketing campaigns. It supports Smithery deployment and TypeScript type safety.